# End-to-end speech recognition

Faster Whisper Base.en
MIT
This is a Whisper base.en model converted based on CTranslate2, used for English speech recognition tasks.
Speech Recognition English
F
Systran
367.44k
4
Assignment1 Joane
MIT
A speech-to-text (S2T) model for automatic speech recognition (ASR)
Speech Recognition Transformers English
A
Classroom-workshop
22
0
Assignment1 Jack
MIT
A speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture
Speech Recognition Transformers English
A
Classroom-workshop
24
0
Assignment1 Jane
MIT
s2t-small-librispeech-asr is a speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture.
Speech Recognition Transformers English
A
Classroom-workshop
29
0
S2t Medium Librispeech Asr
MIT
A speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture
Speech Recognition Transformers English
S
facebook
1,086
9
S2t Small Librispeech Asr
MIT
A speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture
Speech Recognition Transformers English
S
facebook
10.92k
27
Wav2vec2 Marathi Stt
This is a Marathi speech recognition model based on the Wav2Vec2 architecture, capable of directly converting speech to text.
Speech Recognition Transformers
W
addy88
30
0
Kamo Naoyuki Mini An4 Asr Train Raw Bpe Valid.acc.best
This is an automatic speech recognition (ASR) pretrained model based on the ESPnet2 framework, trained on the mini-an4 dataset and supports English speech recognition.
Speech Recognition English
K
espnet
425
1
Asr Wav2vec2 Commonvoice Rw
Apache-2.0
This is an end-to-end model for automatic speech recognition in Rwandan, based on the wav2vec 2.0 pre-trained model combined with CTC and attention mechanisms, fine-tuned on the CommonVoice dataset.
Speech Recognition Other
A
speechbrain
28
1
S2t Large Librispeech Asr
MIT
An end-to-end sequence-to-sequence transformer model for automatic speech recognition (ASR), trained on the LibriSpeech dataset
Speech Recognition Transformers English
S
facebook
422
10
Wav2vec2 Base Turkish Cv8
This is an automatic speech recognition (ASR) model fine-tuned on the Common Voice 8.0 Turkish dataset, capable of converting Turkish speech into text.
Speech Recognition Transformers Other
W
cahya
16
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase